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Abstract 

Many signal processing problems—such as analysis, compression, denoising, and reconstruction—can 
be facilitated by expressing the signal as a linear combination of atoms from a well-chosen dictionary. In 
this paper, we study possible dictionaries for representing the discrete vector one obtains when collecting 
a finite set of uniform samples from a multiband analog signal. By analyzing the spectrum of combined 
time- and multiband-limiting operations in the discrete-time domain, we conclude that the information 
level of the sampled multiband vectors is essentially equal to the time-frequency area. For representing 
these vectors, we consider a dictionary formed by concatenating a collection of modulated Discrete Prolate 
Spheroidal Sequences (DPSS’s). We study the angle between the subspaces spanned by this dictionary 
and an optimal dictionary, and we conclude that the multiband modulated DPSS dictionary—which is 
simple to construct and more flexible than the optimal dictionary in practical applications—is nearly 
optimal for representing multiband sample vectors. We also show that the multiband modulated DPSS 
dictionary not only provides a very high degree of approximation accuracy in an MSE sense for multiband 
sample vectors (using a number of atoms comparable to the information level), but also that it can provide 
high-quality approximations of all sampled sinusoids within the bands of interest. 

Keywords. Multiband signals. Discrete Prolate Spheroidal Sequences, discrete Fourier transform, 
sampling, approximation, signal recovery 
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1 Introduction 

1.1 Signal dictionaries and representations 

Effective techniques for signal processing often rely on meaningful representations that capture the 
structure inherent in the signals of interest. Many signal processing tasks—such as signal denoising, 
recognition, and compression—benefit from having a concise signal representation. Concise signal 
representations are often obtained by (i) constructing a dictionary of elements drawn from the signal 
space, and then {ii) expressing the signal of interest as a linear combination of a small number of atoms 
drawn from the dictionary. 

Throughout this paper, we consider the signal space C^, and we represent a dictionary as an A x L 
matrix which has columns (or atoms) Using this dictionary, a signal x G can 

be represented exactly or approximately as a linear combination of the 'ipi: 

L-l 

X « cx[i]if>i 

i=0 

for some a G C^, whose entries are referred to as coefficients. 

When the coefficients have a small fraction of nonzero values or decay quickly, one can form highly 
accurate and concise approximations of the original signal using just a small number of atoms. In some 
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cases, one can achieve this using a linear approximation that is formed with a prescribed subset of J < L 
atoms: 

a: « (1) 

ien 

where C {0,1,..., L — 1} is a fixed subset of cardinality J. For example, one might use the lowest J 
frequencies to approximate bandlimited signals in a Fourier basis. 

In other cases, it may be beneficial to adaptively choose a set of atoms in order to optimally represent 
each signal. Such a nonlinear approximation can be expressed as 

a: « ^ Q.[i]ipi, 

where f7(a;) C {0,1,..., L — 1} is a particular subset of cardinality J and can change from signal to 
signal. A more thorough discussion of this topic, which is also known as sparse approximation, can 
be found in [14, 15, 29]. Sparse approximations have been widely used for signal denoising [16], signal 
recovery [4] and compressive sensing (CS) [5, 6, 8, 10, 17], an emerging research area that aims to 
break through the Shannon-Nyquist limit for sampling analog signals. A challenge in finding the best 
J-term approximation for a given signal x is to identify which of the (^) subspaces (or, equivalently, 
index sets 17(a;)) to use. This problem has garnered much attention in the applied mathematics and 
signal processing communities, and conditions can be established under which methods based on convex 
optimization [5, 9, 18] and greedy algorithms ]3, 30, 31, 40] provide suitable approximations. 

1.2 Dictionaries for finite-length vectors of sampled analog signals 

In this paper, we study dictionaries for representing the discrete vector one obtains when collecting a 
finite set of uniform samples from a certain type of analog signal. We let x{t) denote a complex-valued 
analog (continuous-time) signal, and for some finite number of samples N and some sampling period 
Ts > 0, we let 

a; = ]x(0) x{Ts) ■ ■ ■ x{{N - 1)T.)]^ (2) 

denote the length-Af vector obtained by uniformly sampling x{t) over the time interval ]0,A^Ts) with 
sampling period Ts- Here T stands for the transpose operator. Our focus is on obtaining a dictionary 
that provides highly accurate approximations of x using as few atoms as possible. 

It is the structure we assume in the analog signal x{t) that motivates the search for a concise 
representation of x. Specihcally, we assume that x{t) obeys a multiband signal model, in which the 
signal’s continuous-time Fourier transform (CTFT) is supported on a small number of narrow bands 
(we assume the bands are known). We describe this model more fully in Section 1.2.2. Before doing so, 
we begin in Section 1.2.1 with a simpler analog signal model for which an efficient dictionary is 
easier to describe. 

1.2.1 Multitone signals 

A multitone analog signal is one that can be expressed as a sum of J complex exponentials of various 
frequencies: 

x{t) = 

i=0 

Suppose such a multitone signal x{t) is bandlimited with bandlimit Hz, i.e., that maxi [Fi] < . 

Let X, as dehned in (2), denote the length-vector obtained by uniformly sampling x{t) over the time 
interval [0, NTs) with sampling period Ts < ^— which meets the Nyquist sampling rate. We can express 
these samples as 

. 1-1 

n = 0,1,..., Af - 1, (3) 

i=0 

where fi = FiTs- This model arises in problems such as radar signal processing with point targets [27] 
and super-resolution [7]. 

In certain cases, an effective dictionary for representing x is the N x N discrete Fourier transform 
(DFT) matrix [2, 41, 27], where '4>i[n] = gt^Trin/iv j — q x _ jy — 1 and n = 0,1,..., A^ — 1. Using 
this dictionary, we can write x = ^a, where a € contains the DFT coefficients of x. When the 
frequencies fi appearing in (3) are all integer multiples of l/N, then a will be J-sparse (meaning that 
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it has at most J nonzero entries), and the sparse strncture of x{t) in the analog domain will directly 
translate into a concise representation for x in C^. This “on grid” multitone signal is sometimes assumed 
for simplicity in the CS literature [41]. However, when the frequencies comprising x{t) are arbitrary, the 
sparse structure in a will be destroyed due to the “DFT leakage” phenomenon. Such a problem can be 
mitigated by applying a windowing fnnction in the sampling system, as in [41], or iteratively using a 
refined dictionary [20]. An alternative is to consider the model (3) directly as in ]19, 39]. However, snch 
approaches cannot be generalized to scenarios in which the analog signals contain several bands, each 
with non-negligible bandwidth. 

1.2.2 Multiband signals 

A more realistic model for a strnctured analog signal is a multiband model, in which x{t) has a CTFT 
supported on a nnion of several narrow bands 

.7-1 

F = U [Fi — Bbandi/2, Fi + 5bandi/2], 
i=0 


i.e., 

x{t) = f X{F)e‘^"^UF. 

Jv 

Here X{F) denotes CTFT of x{t). The band centers are given by the frequencies {Fi}i6[j] S'lid the band 
widths are denoted by {Bbandi}ie[J]: where [J] denotes the set {0,1,..., J — 1}. 

Again we let x, as defined in (2), denote the length-A^ vector obtained by nniformly sampling x{t) 
over the time interval [0, NTs) with sampling period Ts- We assnme Ts is chosen to satisfy the minimum 
Nyquist sampling rate, which means 


Unyq 2maXi^[j] {[Fl ih Hband^/2]} 

Under these assnmptions, the sampled multiband signal x can be expressed as an integral of sampled 
pure tones (i.e., discrete-time sinusoids) 

x[n] = f x{f)e^'^^^”' df, n = 0,1,..., N - 1, (4) 

Jw 

where the digital frequency / is integrated over the union of intervals 


W := Ts¥ = [fo - Wo, fo + lUo] U [/i - Wi, /i -f Wi] U • • • U [/j-i - + Wj-i] C 


1 1 
2 ’ 2 


(5) 


with fi = TsFi and Wi = TsBhE,ndj2 for all i £ [J]. The weighting function x{f) appearing in (4) equals 
the scaled CTFT of x{t), 

and corresponds to the discrete-time Fourier transform (DTFT) of the inhnite sample sequence 
{. ..,*(—2rs), a;(—Ts), a;(0), a;(rs), a;(2rs),... }. (However, we stress that our interest is on the 
finite-length sample vector x and not on this infinite sample sequence.) Such multiband signal models 
arise in problems such as radar signal processing with non-point targets ]1] and mitigation of 
narrowband interference [If, 12]. 

In this paper, we focus on building a dictionary in which hnite-length sample vectors arising from 
multiband analog signals can be well-approximated using a small number of atoms. The DFT basis is 
inefficient for representing these signals because the DFT frequencies comprise only a regular, finite grid 
rather than a continuum of frequencies as appears in (5). Consequently, as previously discussed, any “off 
grid” frequency content in x{t) will spread across the DFT frequencies when the signal is sampled and 
time-limited. 

In the simplified case of a baseband signal model (where J = 1, Fb = 0, and Ts <C —), an efficient 

-Onyq ^ 

alternative to the DFT basis is given by the dictionary of Discrete Prolate Spheroidal Sequences 
(DPSS’s) [37]. DPSS’s are a collection of bandlimited sequences that are most concentrated in time to 
a given index range and the DPSS vectors are the hnite-support sequences (or vectors) whose DTFT is 
most concentrated in a given bandwidth ]37]; we review properties of DPSS’s in Section 2.3. DPSS’s 
provide a highly efficient basis for representing sampled bandlimited signals (when W reduces to a 
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simple band [—VKo,Wo]) and have proved to be usefnl in numerons signal processing applications. For 
instance, extrapolating a signal from a finite set of samples is an important problem with applications 
in remote sensing and other areas [33]. One can apply DPSS’s to hnd the minimnm energy, 
infinite-length bandlimited sequence that extrapolates a given finite vector of samples [37]. Another 
problem involves estimating time-varying channels in wireless commnnication systems. In [42], Zemen 
and Mecklenbrauke showed that expressing the time-varying subcarrier coefficients with a DPSS basis 
yields better estimates than those obtained with a DFT basis, which suffers from frequency leakage. 

By modulating the baseband DPSS vectors to different frequency bands and then concatenating these 
dictionaries, one can construct a new dictionary that provides an efficient representation of sampled 
mnltiband signals. Sejdic et al. [36] proposed one such dictionary to provide a sparse representation for 
fading channels and improve channel estimation accuracy. Zemen et al. [43, 44] ntilized multiband DPSS 
sequences for band-limited prediction and estimation of time-variant channels. In CS, Davenport and 
Wakin [13] studied multiband modulated DPSS dictionaries for recovery of sampled multiband signals, 
and Sejdic et al. [35] applied these dictionaries for the recovery of physiological signals from compressive 
measurements. Ahmad et al. [1] used such dictionaries for mitigating wall clutter in through-the-wall 
radar imaging, and modulated DPSS’s can also be useful for detecting targets behind the wall [45, 46]. 

In most of these works, the dictionary is assembled by partitioning the digital bandwidth ]—|, 5 ] 
uniformly into many bands and constructing a modulated DPSS basis for each band. The key fact that 
makes such a dictionary useful is that hnite-length sample vectors arising from multiband analog 
signals will tend to have a block-sparse representation in this dictionary, where only those bands in the 
dictionary overlapping the frequencies W are utilized. With this block-sparse structure, [13] provided 
theoretical guarantees into the use of this dictionary for sparsely representing sampled multiband 
signals and recovering sampled multiband signals from compressive measurements. For example, using 
a block-based CS reconstruction algorithm, we are guaranteed that most hnite-length sample vectors 
arising from multiband analog signals can be highly accurately recovered from a number of compressive 
measurements that is proportional to the multiband signal’s total spectral support [13, Theorem 5.6]. 
Experiments demonstrate that reconstruction using the multiband modulated DPSS dictionary yields 
superior performance compared to reconstruction with the DFT basis. 

To date, however, relatively little work has focused on providing formal approximation guarantees for 
sampled multiband signals using multiband modulated DPSS dictionaries. To the best of our knowledge, 
an approximation guarantee in a mean-square error (MSE) sense was hrst presented formally in [13]. 
However, the question of how this dictionary compares to an optimal one has not been addressed. The 
objective of this paper is to answer this question and related ones. 


1.3 Contributions 


We study multiband modulated DPSS dictionaries in terms of the subspaces they span on the respective 
bands. More specihcally, let 


e/ := 


gj27r/0 

gi27r/l 


gj 27 r/(iV-l) 


G C^, / € ]- 



denote a length-A^ vector of samples from a discrete-time complex exponential signal with digital 
frequency /. Then, it follows directly from (4) that a multiband sample vector x can be expressed as 

® / *(/)e/ df, ( 6 ) 

Jw 

where W is as defined in (5). We can interpret this equation geometrically: the sampled complex 
exponentials {e/}/g[_i/ 2 ,i/ 2 i comprise a one-dimensional submanifold of C^. The vectors 

AIw := {e/}/6w 

trace out a union of J finite-length curves belonging to this manifold. The sample vector x can be 
expressed as an integral over the vectors in AIw, with weights determined by x{f). 

We are interested in several questions relating to the union of curves A4w: 

• What is its effective dimensionality? That is, what dimensionality of a union of subspaces could 
nearly capture the energy of all signals in A4w, in the £2 metric? 
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• What is a suitable basis for the collective span of this union of subspaces? 

Since we consider £2 approximation error, we will approach the approximation problem via the Karhunen- 
Loeve (KL) transform (also known as principal component analysis (PCA) [26]) [38, 13[. We can imagine 
drawing a vector randomly from Mw with random phase, and we study the covariance structure of this 
random vector. Its covariance matrix is Bjv.w, which has entries 


>N,1 


/[m,n] := / 

Jw 


J 27 rf(m — n) 


df=Y: 


gj 27 r/i(m-n) sju (27rWj (m - n)) 
7r(m — n) 


(7) 


for all m,n £ [A^]. The eigen-decomposition of provides the optimal dictionary for linearly 

approximating this random vector. In particular, the k eigenvectors corresponding to the k largest 
eigenvalues of Bjv,w span the fc-dimensional subspace of that best captures these random vectors in 
terms of MSE; the resulting MSE equals the sum of the N — k smallest eigenvalues. When k can be 
chosen such that this residual sum is indeed small, this indicates that the effective dimensionality 
(informally, the “information level”) of the set A4w is roughly k. 

The first contribution of this paper is to investigate the spectrum of the matrix Bjv.w, which is 
equivalent^ to a composed time- and multiband- limiting operator TmBwI-n defined in Section 2.2. In line 
with analogous results for time-frequency localization in the continuous-time domain [25, 28], we extend 
some of the techniques from [25, 28] for the discrete-time case and show that the number of dominant 
eigenvalues of InBw^n (and hence Bjv.w) is essentially the time-frequency area A]W] = '^^2NWi, 
which also reveals the effective dimensionality of the union of curves AIw- Furthermore, similar to the 
concentration behavior of the DPSS eigenvalues for a single frequency band, we show that the eigenvalues 
of the operator XnB^T’^ have a distinctive behavior: the first « A]W] eigenvalues tend to cluster near 
1 , while the remaining eigenvalues tend to cluster near 0 after a narrow transition, which has width 
proportional to the number of bands times log(A). All of these facts tell us that « A]W] atoms are 
needed in order to accurately approximate, in an MSE sense, discrete-time sinusoids with frequencies 
in W. As indicated in (6), such discrete-time sinusoids are themselves the building blocks of sampled 
multiband signals. 

The second contribution of this paper is to show that the multiband modulated DPSS dictionary is 
approximately the optimal one for representing sampled multiband signals. Specifically, we show that 
there is a near nesting relationship between the subspaces spanned by the true eigenvectors of Biv.w 
and by the multiband modulated DPSS vectors on the bands of interest.^ Directly computing both 
baseband DPSS vectors and the eigenvectors of Bn,w can be difficult, as the clustering of the eigenvalues 
makes the problem ill-conditioned. However, several references such as [21, 37] have pointed out that 
the baseband DPSS’s can also be computed by noting that the corresponding prolate matrix commutes 
with a well-conditioned symmetric tridiagonal matrix. Thus, the multiband modulated DPSS dictionary, 
which merely consists of various modulations of baseband DPSS’s, can be constructed more easily than 
the optimal one (which consists of the eigenvectors of Bjv.w)- 

The third contribution of this paper is to confirm that the multiband modulated DPSS dictionary 
provides a high degree of approximation for all sample vectors e/ of discrete-time sinusoids with 
frequencies / in our bands of interest. We also show that for any continuous-time multiband signal that 
is also approximately time-limited, the resulting finite-length sample vector can be well-approximated 
by the multiband modulated DPSS dictionary. This result serves as a supplement to ]13], which shows 
this approximation guarantee is available for a time-limited signal which has its spectrum concentrated 
in the bands of interest. 

We hope that these results will prove useful in the continued study and application of multiband 
modulated DPSS dictionaries. 

The remainder of this paper is organized as follows. Section 2 defines the time- and multiband-limiting 
operator and provides some important background information on DPSS’s. We state our main results in 
Section 3. We conclude in Section 4 with a final discussion. 

^By equivalent, we mean that Bjv.w® = for any x £ C^. 

^By “bands of interest,” we mean the union of intervals F for continuous-time signals and W for discrete-time signals. We 
assume these bands are known and are used to construct the multiband modulated DPSS dictionary. The results in this paper, 
however, can also have application in the problem of detecting the active bands from a set of possible candidates, as was studied 
in [13]. 
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2 Preliminaries 

2.1 Notation 


Finite-dimensional vectors and matrices are indicated by bold characters. We index all such vectors and 
matrices beginning at 0. The Hermitian transpose of a matrix A is denoted by A^. For any natural 
number N, we let [A'^] denote the set {0,1,..., — 1}. For any k G {1, 2,..., A'^}, let [A\h denote the 

N X k matrix formed by taking the first k columns of A G . In addition, x{N) ~ y{N) means x 

and y are asymptotically equal, that is x{N) = y{N) -1- o{y{N)) = (1 -|- o{l))y{N) as N ^ oo. 


2.2 Time, index, and mnltiband-limiting operators 

To begin, let Bw '■ ^ 2 (Z) —>■ ^ 2 (Z) denote the multiband-limiting operator that bandlimits the DTFT of 
a discrete-time signal to the frequency range W C [—|, |], i.e., for y G f' 2 (^), we have that 

Bwiy)[m]:= f ■ky[m\^ ^ fj/M / ^ df\ , 

JV! V Jw / 


where ★ stands for convolution. In addition, let Tn '■ ^ 2 (Z) —>■ ^ 2 (Z) denote the operator that zeros out 
all entries outside the index range {0,1,..., Af — 1}. That is 




Next, define the index-limiting operator In ■ £2(1) —> as 

lN{y)[m] ■- y[m], m G [N], 

The adjoint operator In ■ —>■ £ 2 ( 1 ) (anti-index-limiting operator) is given by 


rne[N], 

lN{y)[m\ . I otherwise. 


We can observe that Tn = In^n ■ 

Now the time- and multiband-limiting operator BwTn '■ £2(1) —>■ £2(1) is defined by 


B, 


N-l, 

ATN{y))[ra] ^ (y[n\ / 

„_n V 


m 


( 8 ) 


Further composing the time- and multiband-limiting operators, we obtain the linear operator TnBwTn '■ 
£ 2 ( 1 ) —>■ £ 2 ( 1 ) as 


Tn (Bw(Tn (y))) [m] 


Et“o (j/M /w df), m G [N ], 

0 , otherwise. 


(9) 


Similarly, combining the index- and multiband-limiting operators, we obtain the linear operator 
InBwI*n ; as 


i(Bw(lN(y)))[m] = ^ (y[n\ f "'dA , mG [A^]. 

„_n \ «/W / 


( 10 ) 


Suppose y' G £ 2 ( 1 ) is an eigenfunction of TnBwTn with corresponding eigenvalue \': 
TN(Bw(TN(y'))) = \'y'■ We can verify that lN(Bw(lN(^N(y')))) = X'lN(y')- On the other hand, if y" 
and A" satisfy lN{Bw(lN{y''))) = X"y'', then we can conclude that 7iv(f?w('7iv(TAr(y”)))) = X"lN{y")- 
Therefore TnBwTn and InBwIn have the same eigenvalues, and the eigenvectors of InBwIn can be 
obtained by index-limiting the eigenvectors of TnBwTn- 

Note that InBwIn is equivalent to the covariance matrix Bjv,w (see (7)), as a linear operator on C^. 
Thus, in order to answer the questions raised in Section 1.3, we will study the eigenvalue concentration 
behavior of InBwIn- 
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2.3 DPSS bases for sampled bandlimited signals 

In this subsection, we briefly review important definitions and properties of DPSS’s from [13, 37]. 


2.3.1 DPSS’s and DPSS vectors 


Definition 2.1. (DPSS’s [37]) Given W € (0, |) and 77 e N, the Discrete Prolate Spheroidal 
Sequences (DPSS’s) • • • > real-valued discrete-time sequences that satisfy 

B[-w,w]iTN{s^N,w)) = ^n,w^n\w fo'^ 0.11 I € [77], Here ore the eigenvalues of the 

operator B[-w,w]Tn with order 1 > > • • • > > 0. 

The DPSS’s are orthogonal on Z and on {0,1, ..., 77 — 1}, and they are normalized so that 




1, fc = z 

0 , k ^ 1. 


Consequently, it can be shown [37] that = (X^mw) The vector obtained by index-limiting 

s^^w to the index range {0,1,..., — 1} is an eigenvector of the N x N matrix Bn,w with elements 

given by^ 


SN,w[m,n\ := 




j2-!Tf{m-n) 


df = 


sin( 27 rlT (m — n)) 


-w Tr(m-n) 

DPSS’s are useful for constructing a dictionary that efficiently represents index-limited versions of 
sampled bandlimited signals. As pointed out in [13], the index-limited DPSS’s also satisfy 
Ziv(f?[-w.w] (Tat (s^^vv))) “ A^^jyZjv). 


Definition 2.2. (DPSS vectors [37]) Given W € (0, |) and N € N, the DPSS vectors 

• • • ’ ^ ore defined by index-limiting the DPSS’s to the index range {0,1,..., ZV — 1}; 


*iV.W 


lN{s\Jyy) 


and satisfy 


Z]v(f?[-W,W] ^ 


(i) 


x(i) Jl) 

^N,W°N,W 


It follows that Bn,w can be factorized as 


N,w = Sn,WJ^N,W^N,Wj 




where An,w is a,n N x N diagonal matrix whose diagonal elements are the DPSS eigenvalues 
A^^VV’ A^V,..., and Sn,w is a square (N x N) matrix whose Z-th column is the DPSS vector 

^n]w for a-ll Z e [77]. 

The following provides a useful result on the clustering of the eigenvalues A®^y, ..., 

Lemma 2.3. (Clustering of eigenvalues [13, 37[) Suppose that W G (0, |) is fixed. 

1. Fix t G (0,1). Then there exist constants Ci{W,e),C 2 {W,t) (which may depend on W,€) and an 
integer ZVo(lT, e) (which may also depend onW,e) such that 

1 - X%]^ < V Z < [2771T(1 - e)J 

for all 77 > ZVo(W,e). 

2. Fix e G (0, — 1). Then there exist constants C 3 {W, e), C 4 {W, e) (which may depend onW,e) and 

an integer N-i{W,e) (which may also depend on W,e) such that 

X%]w < V Z > [2771^(1-b e)] 

for all N > ZVi(W,e). 


In words, the first « 2NW eigenvalues tend to cluster very close to 1, while the remaining eigenvalues 
tend to cluster very close to 0. As a consequence of this behavior, the effective dimensionality of the 
vectors Al[_w,iv] •= {^f}f€l-w,w] (which trace out a finite-length curve in C^) is essentially 2NW, in 
the sense that we can use a subspace formed by the first « 2NW DPSS vectors to approximate this 
curve with low MSE. 

®For convenience, we use Bpf^w instead of Biv.[-w,w] to denote the matrix which is equivalent to the operator 
XNFi—w,w]T[f This is also the reason that we use Ajv,Wi and sjv,w (which will be defined later) instead of Ajy ^y], 

^N,[-w,w] and S]v,[-w,wl ■ 


7 



2.3.2 DPSS bases for sampled bandpass signals 

Let us now consider the eigenvectors of the operator ir]v(f?[/^_w,/c+vri (^iv)): which can be expressed as: 

/ rfa+w \ 

lN{By^.w,f,+w]{^*N{y)))lm\ = ( / \ * {X*i^{y)[m]) 

\Jfa-w J 

_ „j 2 ,r/,(m-n) sin (27rtL(m - n)) 

^ ® 7r(m - n) ^ 


for all m = 0,1,..., — 1. Let Ef^ denote an Ai x diagonal matrix with entries 


-E/c [m, n] 


m = n, 
0 , m ^ n. 


We can verify that the modulated DPSS vectors ..., satisfy 


^^'(^[/c-vr,/c+vr] (21iv(-E'/^s^^^y))) — Ef^BN,wEf^Ef^s)Jy^ — 


(!) 


v(!) 


„(!) 


That is, Ef^s^^yy) is an eigenpair of the operator (riv(f?y^_w,/^+wi (21jv)) ^ ^ [-^l- 

For any integer k £ {1, 2,..., A^}, let Q := [Ef^SN,w]k denote the N xk matrix formed by taking the 
first k modnlated DPSS vectors. Also let Pq denote the orthogonal projection onto the column space of 
Q. It is shown in [13] that the dictionary Q provides very accurate approximations (in an MSE sense) 
for finite-length sample vectors arising from sampling random bandpass signals. 

Theorem 2.4. ([13] Theorem 4-S) Suppose x is a eontinuous, zero-mean, wide sense stationary random 
process with power spectrum 


P4F) 


1 

Sband ’ 


0, 


Fe[F,-^,F, + ^], 

otherwise. 


Let X = [x(0) xiTs) ... x{{N — l)Ts)]^ £ denote a finite vector of samples acquired from x{t) with 
a sampling interval ofTs < l/(2max{|F’c ± |})- Let fa = FcTs and W = ^band^^s ^ have 

1 rfa+w 1 

E[||*-Pq*||^] =—^ \\ef-PQef\\Uf=^J2^N,w 

Furthermore, for fixed e £ (0, — 1), set k = 2A!'VF(1 -I- e). Then 

E [ll® - Pqx\\1] < 

for all N > Ni{W,e), where Ni{W,e), C3(1F, e), C 4 {W,e) are constants specified in Lemma 2.3. For 
comparison, E [||a:|| 2 ] = ||e /||2 = N. 


3 Main Results 

We now consider the mnltiband case, where 

W = [/o - Wo, fo + Wo] U [/i - Wi, /i + Wi] U • • • U [/j_i - Wj.4,fj-i + Wj_i] C -1,1 

is a union of J intervals as in (5). For each i £ [J], define = [FI/.S’jv.wJfci some value ki £ 
{1, 2,..., N} that we can choose as desired. We constrnct the multiband modulated DPSS dictionary 4' 
by concatenating these subdictionaries: 


^ := [^0 4'j-i]. 


( 12 ) 


In this section, we investigate the efficiency of using ^ to represent discrete-time sinusoids and sampled 
mnltiband signals. 



3.1 Eigenvalues for time- and multiband-limiting operator 

We begin by studying the eigenvalue concentration behavior of the operator XnBw^n (and hence Bn,w), 
which reveals the effective dimensionality of the finite union of curves Afw = 

We first establish the following rough bound, which states that all the eigenvalues of TnBih/X’^ are 
between 0 and 1. 

Lemma 3.1. For any W C [—|, |] and N, the operator InBwXn positive-definite with eigenvalues 


^ ^ ^JV,W — ^iV.W — — ^iV,W 


> 0 


and 


N-l 


^ A«^ = iV|W|. 
1=0 


We denote the corresponding eigenvectors of InBwIn by 


• 1 “iV.W 


Proof. See Appendix A. 


There is, in fact, a sharp transition in the distribution of the eigenvalues of InBwIn- We establish 
this fact in the following theorem. 

Theorem 3.2. Suppose W is a finite union of J pairwise disjoint intervals as defined in (5). For any 
e £ (0, |), the number of eigenvalues of InBviIn that are between e and 1 — e satisfies 




<l-e} <J- 


■ log{N - 1) + 


2 2iV-l 
TT^ N — 1 


e(l-e) 


(13) 


Proof. See Appendix B. 


This result states that the number of eigenvalues in [e, 1 — e] is in the order of log(A^) for any fixed 
£ € (0, |). Along with the following result which states that the number of eigenvalues of InBwIn 
greater than | equals ~ Ai|W|, we conclude that the effective dimensionality of A4w is approximately 
iV|W| 


Theorem 3.3. LetW C [— 5 ] a finite union of J disjoint intervals having the form in (5). Denote 
by 

i_ = ff{n £ Z : —J < n < [ ^ ~ (—- ——)CW} 

itl L 2 J - - L 2 J’ '-V 2N'N 2N’ ' 


and 


= (^ - 2 ^. ^ + 2 ^) H W ^ 0}. 


In particular, it holds that [Ai|W|J — 2J + 2 < < [A’l'WI] + 2J — 2. Then the eigenvalues of the 

operator InBviIn satisfy 


A 


h--i) ^ 1 ^ xh+) 
i 2 — ^Jv,w 


Proof. See Appendix C. 

Note that results similar to the above two theorems for time-frequency localization in the continuous 
domain have been established in [22, 25, 28]. Similar to the ideas used in [22], the key to proving 
Theorem 3.2 is to obtain an upper bound on the distance between the trace of InB^In and the sum 
of the squared eigenvalues of InBwIn- Constructing an appropriate subspace with a carefully selected 
bandlimited sequence for the Weyl-Courant minimax characterization of eigenvalues is the key to proving 
Theorem 3.3. The proof techniques of [25, 28] form the basis of our analysis in Appendix C, but some 
modifications are required to extend their results to the discrete domain. 

Similar to what happens in the single band case (when J = 1; see Lemma 2.3), the eigenvalues of 
InB^In have a distinctive behavior: the first Ai|W| = 2.NWi eigenvalues tend to cluster very close 
to 1, while the remaining eigenvalues tend to cluster very close to 0, after a narrow transition. This is 
captured formally in the following result. 
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Theorem 3.4. Let W C [—|, |] he a fixed finite union of J disjoint intervals having the form in (5). 

1. Fix eG (0,1). Then there exist constants Cl (W,e),C 2 i'Wje) (which may depend on W and e) and 
an integer No(W, e) (which may also depend on W and e) such that 

> 1 - Ci{W, V / < J - 1 + ^[2ArTTi(l - e)J 


for all N > Nq(W, e). 

2. Fix e € (0,— 1). Then there exist constants C 3 (W, e), C' 4 (W, e) (which may depend on W and 
e) and an integer Ni(W,e) (which may also depend onW and e) such that 

V I > ^r2Wi(l + e)l 


for all N > iVi(W, e). 

We point out that iVo(W,e) > max {iVo(Wi, e), V i G [J]}, C 2 {W,e) = v ie[J]} ^ 

C’3(W,e) = Jmax{C3(lTi,£), V i G [J]} and C4(W,e) = mm{C4{Wi,e), V i G [J]}, which will prove 
useful in our analysis below. Here C 2 {Wi,e), C' 3 (Wi,e), and CA{Wi,e) are as specified in Lemma 2.3. 

Proof. See Appendix D. 


3.2 Multiband modulated DPSS dictionaries for sampled multiband 
signals 

Let p G {1, 2,..., N}. Define 

^ — ['^n]w (14) 

where V Z G [N] are the eigenvectors of InBwFn. Let ^ be the multiband modulated DPSS 

dictionary defined in (12). 

There are three main reasons why the dictionary ^ may be useful representing sampled multiband 
signals. First, direct computation of $ is difficult due to the clustering of the eigenvalues of Bn,w. 
However, in the single band case, the matrix Bm,w is known to commute with a symmetric tridiagonal 
matrix that has well-separated eigenvalues, and hence its eigenvectors can be efficiently and stably 
computed [37]. Griinbaum [21] gave a certain condition for a Toeplitz matrix to commute with a 
tridiagonal matrix with a simple spectrum. We can check that the matrix Bjv.w in general does not 
satisfy this condition, except for the case when W consists of only a single interval. However, we 
emphasize that ^ is constructed simply by modulating DPSS’s, which, again, can be computed 
efficiently. 

Second, the multiband modulated DPSS dictionary ^ provides an efficient representation for 
sampled multiband signals. Davenport and Wakin [13] provided theoretical guarantees into the use of 
this dictionary for sparsely representing sampled multiband signals and recovering sampled multiband 
signals from compressive measurements. We extend one of these guarantees in Section 3.2.3. Moreover, 
we confirm that a multiband modulated DPSS dictionary provides a high degree of approximation for 
all discrete-time sinusoids with frequencies in W in Section 3.2.2. 

Third, as indicated by the results in Section 3.1, « ZV]W] dictionary atoms are necessary in order 
to achieve a high degree of approximation for the discrete-time sinusoids in an MSE sense. Our results, 
along with [13], show that the multiband modulated DPSS dictionary ^ with « ZV]W] atoms can indeed 
approximate discrete-time sinusoids with high accuracy. In order to help explain this result, we first 
show that there is a near nesting relationship between the subspaces spanned by the columns of 4' and 
by the columns of the optimal dictionary 

3.2.1 The subspace angle between 5^ and 5# 

In order to compare subspaces of possibly different dimensions, we require the following definition of 
angle between subspaces. 
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Definition 3.5. Let <S* and iS* be the subspaces formed by the columns of the matrices and $ 
respectively. The subspace angle between <S* and <5* is given by 


008 ( 05 ^ 5 .^):= inf ||Pm -</)||2 
<i>e'S.i.,||<;>||2=i 


if dim{S^) > dim{S,^), or 

cos(es.j,s.s.) := I Ji-P*V’l|2 

if dimiS^) < dim{S^). Here P<p (or P^) denotes the orthogonal projection onto the column space of 
(or ^). 

Our first guarantee considers the case where in constructing 4', each ki is chosen slightly smaller 
than 2NWi, and in constructing we take p to be slightly larger than 2NWi. In this case, we can 
guarantee that the subspace angle between 5-j. and 5* is small. 

Theorem 3.6. Let W C [—|, |] be a fixed finite union of J disjoint intervals having the form in (5). 
Pi® e € (0, min{1, 1 ^ — 1}). Let p — ^^\2NWi{\ + ef] and ^ be the N x p matrix defined in (If). Also 
let ki < [2A^Wi(l — e)J, Vi € [J] and be the matrix defined in (12). Then for any column tp in 




(^1 - (7i(W, e)e-C2(w,.)iv _ c^iW, e)e-CA'^,ANy 


Ki{N,W,e) 


and _ 

1 - Ki(Ai, W, e) - N^Ki{N,W,e) - 3N^JCi(W, e)e — 

1 + 3iV\/Ci(W,e)e-^^^^^^ 

if N > max{]Vo(W,e),iVi(W,e)}. Here Ci(W,e) = max{Ci(lTi,e), V i e [J]}, 

C' 2 (W,£) = min{C' 2 (m,£), Vie [J]}, No{W,e), Ni{W,e), C 3 (W,e), ondC 4 (W,e) are the constants 
specified in Theorem 3.4, and Ci(Wi,e) and C 2 {Wi,e) are the constants specified in Lemma 2.3. 


cos{Qs<i,s^) > 


\ 


Proof. See Appendix E. 


We can also guarantee that the subspace angle between <S 4 > and <S* is small if, in constructing 4', 
each ki is chosen slightly larger than 2NWi, and in constructing we take p to be slightly smaller than 
2NWi. This result is established in Corollary 3.8, which follows from Theorem 3.7. 

Theorem 3.7. Let W C [—|, |] he a finite union of J disjoint intervals having the form in (5). Given 
some values ki £ {1, 2,..., N},'ii £ [J], let ^ he the matrix defined in (12). Then 


II 


AO 

''iV,V 


-EE> 


( 0 ) 

N.Wi 


for all I £ {0,1 ,..., Ai — 1}. 


Proof. See Appendix F. 

Corollary 3.8. Let W C [— 5 , |] be a fixed finite union of J disjoint intervals having the form in (5). 
Fix e £ (0, min{l, — 1}). Let p < -J — 1 + l^NWi)! — e)J and $ be the N x p matrix defined in 

(If)- Also let ki = |'2A'Wi(l + e)],Vi £ [J] and be the matrix defined in (12). Then for any column 

(0 ■ 

■“JV.W 

and 

008 (05.^5.^) > \j 1 - 2k 2{N,W, e) + kUN,W, e) - Ni/^{N,W, e) - W, e) (16) 

for all N > max{]Vo(W,e),iVi(W,e)}, where iVo(W, e), iVi (W, e), Ci(W, e), C 2 (W, e), C 3 (W, e) and 
C' 4 (W, e) are constants specified in Theorem S.f, and K 2 {N,W,e) is defined as 
K. 2 iN,W, e) := Ci(W, e)Ai 2 e-^ 2 ('^’VJV NCsiW, e)e-^Vw, 0 JV^ 
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Proof. See Appendix G. 

Although our results hold for scenarios where one dictionary contains 'Y^^y2NWi{l — e)J atoms while 
another one has '^i\2NWi{l + e)] atoms, we note that these dimensions can be made very close by 
choosing e sufficiently small.^ 

3.2.2 Approximation quality for discrete-time sinusoids 

The above results show that spans nearly the same space as in the case where both dictionaries 
contain « AIW] columns. In this section, we investigate the approximation quality of for discrete¬ 
time sinusoids with frequencies in the bands of interest. Then, in the next section, we investigate the 
approximation quality of ^ for sampled multiband signals. 

We first prove that a single band dictionary with slightly more than 2NW baseband DPSS vectors can 
capture almost all of the energy in any sinusoid with a frequency in [—W, W]. Our analysis is based upon 
an expression for the DTFT of the DPSS vectors proposed in [37]. We review this result in Appendix H. 

Theorem 3.9. FixW £ (0, §) and t £ (0,^-1). LetW = | - W, e'= and k ^ 2NW(1 + e). 

Then there exists a constant Cq{W' ,e) (which may depend on W' and e ) such that 

for all N > No{W',e'), where NQ{W',e') and C 2 (lV',e') are constants defined in Lemma 2.3. 

Proof. See Appendix I. 

To the best of our knowledge, this is the first work that rigorously shows that every discrete-time 
sinusoid with a frequency / £ [—W, W] is well-approximated by a DPSS basis [S']v,w]fc with k slightly 
larger than 2NW. This result extends the approximation guarantee in an MSE sense presented in [13]. 
We now extend this result for the multiband modulated DPSS dictionary. 

Corollary 3.10. Let W C [—|, |] 6e a fixed finite union of ,J disjoint intervals having the form in (5). 
Fix e £ (0,— 1). Let ki = 2NWi{l + e),'ii £ [J] and be the matrix defined in (12). Then there exist 
constants (7io(W, e) and C'ii(W, e) (which may depend onW and e) and an integer N 2 (W,e) (which may 
also depend on W and e) such that 

||e/- P 4 ,e/||i < V/e W (17) 


for all N> N 2 {W,e). 
Proof. See Appendix J. 


3.2.3 Approximation quality for sampled multiband signals (statistical analysis) 

As indicated in [13], in a probabilistic sense, most finite-length sample vectors arising from multiband 
analog signals can be well-approximated by the multiband modulated DPSS dictionary. In this final 
section, we generalize the result ]13, Theorem 4.4] to sampled multiband signals where each band has a 
possibly different width. 

Theorem 3.11. Suppose for each i £ [J], xfit) is a continuous-time, zero-mean, wide sense stationary 
random process with power spectrum 


P^.iF) 


E. 

0, 


Fi- 


•^band,- 


<F<Fi-£ 


•^band,- 




otherwise, 


(18) 


and furthermore suppose xo(t), xi(t),..., xj-\{f) are independent and jointly wide sense stationary. Let 
Ts denote a sampling interval chosen to satisfy the minimum Nyquist sampling rate, which means Ts < 

:= 1/ (^2 max ||-Pi ± | , V i £ [>^]|) • Let Xi = [*i(0) xfiTs) ... Xi{{N — l)Ts)]'^ £ denote 

^Though a small e may require N large enough such that our results hold, f 2 JVW*(l-re )1 ratio between the sizes of 

the two dictionaries) may become close to 1. 
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a finite vector of samples acquired from Xi(t) and let x = ^et fi = FiTs and Wi 

Let be the matrix defined in (12) for some given ki. Then 


B, 


2 


E[| |a; 


P^x\ 


< 


EE 


v(i,) 


( 19 ) 


where E[||a:|||] = N. 

Proof. See Appendix K. 

The right hand side of (19) can be made small by choosing ki ~ 2NWi for each i G [J]; recall 
Lemma 2.3. Aside from allowing for different band widths, the above result improves the upper bound 
of [13, Theorem 4.4] by a factor of J. 

Finally, the following result establishes a deterministic guarantee for the approximation of sampled 
multiband signals using a multiband modulated DPSS dictionary with « AijWj atoms. 

Corollary 3.12. Suppose x is a continuous-time signal with Fourier transform X{F) supported on 

E= U [Fi — Bl,andj2,Fi Bl,andi/2], i.C., 
i=0 

x{t) = f X{F)e^^"^UF. 

Jw 

Let X = [a;(0) x{Ts) ... x{{N — l)ra)]^ G denote a finite vector of samples acquired from x{t) 

with a sampling interval ofTs < 1/(2max{|F/± j}). Let Wi — TgBrandi(2, fi = TgFi for all i G [J], 

and W = “ Wi,fi + Wi]. Fix e G (0, — 1). Let ki — 2A'Wi(l + e),Vi G [J] and let be the 

matrix defined in (12). Then 

\\x-P<s.x\\l< |*(/)|^ • C'io(W,e)iV®/^e"'^“^'^’'^^ (20) 

for all N > N 2 (W, e), where N 2 (W, e), C'io(W, e) and C'ii(W, e) are constants specified in Corollary 3.10. 


Proof. See Appendix L. 


Corollary 3.12 can be applied in various settings: 

• The sequence x[n] encountered in most practical problems has hnite energy. For example, if we 
assume that \x{f)\^df < 1, we conclude that jja; — Pj<a;||| < Cio(W, 

• Moreover, in some practical problems, the finite-energy sequence x\n\ may be approximately time- 
limited to the index range n = 0,1 ,...,A'—1 such that for some 5, \\x\\% = j |Tjv(a ;)||2 > (1~^)II®I|2- 
In this case, (20) guarantees that 


\x-P^x\\l ^ /y |a;(/)pd/ 


1 — 6 


where the last inequality follows from Parseval’s theorem that ||a ;||2 = /y \x{f)\^df. 

Along with the result proved in [13] that samples from a time-limited sequence which is 
approximately bandlimited to the bands of interest can be well-approximated by the multiband 
modulated DPSS dictionary, we conclude that the multiband modulated DPSS dictionary is useful for 
most practical problems involving representing sampled multiband signals. 

However, we point out that not all sampled multiband signals can be well-approximated by the 
multiband modulated DPSS dictionary. To illustrate this, consider the simple case where W reduces 
to a single band [—W, W]. Recalling that the infinite-length DPSS’s are strictly bandlimited, it follows 
that each of the DPSS vectors can be obtained by sampling and time-limiting some strictly bandlimited 
analog signal. Nevertheless, for all I > k, we will have 


AO 


-P 


AO 


[Sjv.wlfc ®iV,wN2 


AO 


= 1 


( 22 ) 
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even when we choose k = 2NW{1 + e). In this case, the approximation guarantee in (22) is much worse 
than what appears in (21). Such examples are pathological, however: the infinite sequence ^ has 
energy | Hi = w)~^^ which according to Lemma 2.3 is exponentially large when I > 2NW (1 + e), 

and yet the energy of the sampled vector is only 1. Moreover, the spectrum of the infinite 

sequence is entirely concentrated in the band [—W, W] while the spectrum of the time-limited 

sequence is almost entirely contained outside the band [—W)IL], and so on. We refer the 

reader to [13] for additional discussion of this topic. 


4 Conclusions 

In this paper, we have provided a thorough analysis of the spectrum of a time- and multiband-limiting 
operator in the discrete-time domain. We have showed that the information level of finite-length 
multiband sample vectors is essentially equal to their time-frequency area, which also indicates the 
number of dictionary atoms required in order to obtain a high-quality approximation. We have also 
considered the angle between the subspaces spanned by the eigenfunctions of the time- and 
multiband-limiting operator and by the multiband modulated DPSS dictionary. Our results show that 
the multiband modulated DPSS dictionary is nearly optimal in terms of representing finite-length 
vectors arising from sampling multiband analog signals. 

We have showed that the multiband modulated DPSS dictionary can not only guarantee a very high 
degree of approximation accuracy in an MSE sense for finite-length multiband sample vectors, but also 
that it can guarantee such accuracy uniformly over all discrete-time sinusoids in the bands of interest. 
Though we are not guaranteed such accuracy uniformly over all sampled multiband signals, we have 
suggested that such accuracy holds for most practical problems involving multiband signals. Thus, our 
work supports the growing evidence that multiband modulated DPSS dictionaries can be useful for 
engineering applications. 
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A Proof of Lemma 3.1 

Proof. Let y ^ C^, y 7 ^ 0 be an arbitrary vector. Then 


iV-l /N-l 


{lN{Bw{TN{y))),y) = '^TNiBv/iT*Niy)))lm]y[m] = L] ( "^dfy[n] \ y[m] 

m—0 m—0 \n —0 ^ / 

/ Vn=0 / dw 


> 0 , 


where y is the complex-conjugate of the vector y, is the DTFT of X’^{y), and the last 

inequality is derived from the fact that compactly supported signals cannot have perfectly fiat magnitude 
response. 


By Parsevel’s Theorem, we know I = ||i/||i. Therefore 


{iN{B^{x*N{y))),y} 


N-l 


I E y[n]e-^"^^"\ 

n=0 


< llyllt 


Thus, we have 


0 < min 

yec^ 


{lN{Bw(X’^{y))),y) ,(0 


— ^N,W — 


< (Tiv(gw(Tj)f(y))),y) .. 

nia.x. 1 . 11 o ^ 

yec^ \\y\\l 


for all I €[N]. 
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By noting that XnBw^n is equivalent to Bn,w, we have 


N-l N-1 N-1 . 

y~^ = trace(BAr,w) = iJ]v,w[n, n] = ^ / = iV|W|. □ 


B Proof of Theorem 3.2 


Proof. First we state a useful inequality about the Frobenius norm of positive semi-definite matrices. 
Suppose X e Y e are two arbitrary positive semi-definite matrices. Then 

IIX 4- Y\\l = trace((X -f Yf{X + Y)'j 

= \\X\\% + \\Y\\l -b 2trace(A:"r) 


where the last inequality is derived from the fact that trace(X^y) is nonnegative, which can be showed 
as follows. By the hypothesis that X and Y are positive semi-definite matrices, we have the factorization 
X^ = X = X^^^X^^^, where X^^^ is also a positive semi-definite matrix.® Then we conclude that 
trace(X"Y) = tiace{X^^^X^^'^Y) = trace{X^^^YX^^^) > 0, since X^^'^YX^^^ is also a positive 
semi-definite matrix. 

We next bound the Frobenius norm of BM,Wi by 


\Bn,w,\\ 1 = N{2Wif + EE 

m^n 
N-1 


( sin ( 27 vWi(m — n)) 


\ 7r(m — n) 

= + 2 ^g(iV - n) y 

n^l ^ 

N-1 / . /o TT7 \ \ 2 N-1 

=4v»f( ™‘;„ ’ ) -2E 

n=l ^ n=l 


/sin(27rfFin) 


= ANWi +2NlWi- 2Wi - E ( 


/ sin (27rFFin) 


Tvn 

2\ N-1 


-2E 


n = l 


sin (27rWin) 


1 


1 


>4:NWi+2N{W,-2Wi-^ -rdx -2^ 


TT'" 


/iV-1 


1 


— dx + 1 
X 


= 2NWi - 4 - 4 log(^ - 1)- 


where the fourth line follows from Parseval’s theorem J2'^=-oo = f^w- ~ 2W4, which 

indicates that E“ i = W. - 2W^ 

Now applying the above results yields 


.7-1 

||-Biv,w||F = II Ef-BN,WiEf.\\%^ 

j-i 

> ^ ll-BiY.lvJlF 

^Note that X has the eigen-decomposition X = VDV^ where V is an orthonormal matrix and £) is a diagonal matrix 
whose diagonal elements are non-negative, giving the square root X^/^ = 
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where the second line follows since Ef.BN,WiEf. is positive semi-definite. Recalling the result stated in 
Lemma 3.1 that ~ trace(i3]v,w) = A''|W|, we get 

^ = trace(Biv.w) - |iBiv.w||l < J ^ - 1)) • 

1^0 ^ ^ 

Thus, equation (13) follows by noting that for any e G (0, |) one has 

N-l 

1=0 c, ^ \ (0 I 

C Proof of Theorem 3.3 


Proof. A precise proof of a similar result for time- and band-limiting operators in the continuous domain 
was first given in [28]. Izu and Lakey [25] extend the result to multiple intervals in the frequency domain 
or time domain. Their work forms the foundation of the following analysis. 

As we have noted, the two operators 7ivf?wT/v and have the same eigenvalues. We work 

with TnBwTn to prove Theorem 3.3. For convenience, we also use ..., to denote the 

decreasing eigenvalues for the operator TnBwTn- We let S([Ai]) denote the subspace of all finite-energy 
sequences supported only on the index set [A^[, that is 

S{[N\) = {y.y€t2{'L).TN{y) = y}. 


First, for all integers I € [N], the Weyl-Courant minimax representation of the eigenvalues can be 
stated as 


( mine msY « / s r, <Tiv(Bw(Tiv(^))),'y) 

''iV.W - S ■ (Tjv(B„(Tiv(y))),y) 

I niaxs,_|_j mmyg£2(z),t/6Sigi t 

f mine msY (Tv(^w(Tv(y))).y) 

^ J mmsj maXygs([;v]),aXS( (y^y^ : 

^ 1 min „ . (Tn (B„(Tn( y))),y) (23) 

[ ™ax5,_|_j mmyg5([jvi),i/eSi+i > 

_ I mins, maxygs([]v]).axs, 

1 maxs,gi min„gg([,v]),^6S,+i ’ 

where Si is an /-dimensional subspace of £ 2 (Z), and y{f) is the DTFT of the sequence y. Noting that all 
the eigenvectors of TnBwTn belong to S([Ai]), we restrict to y e >S'([A']) in the second line. 

Lemma C.l. Consider the bandlimited sequence g € f 2 (Z) whose DTFT is given by 

, , r ^cos(iV^/)e-^^-^Lfl, |/|<^, 

I 0, ^<|/|<i ^ ^ 


Then \\g\\l = 1 and g[n] > for all n G [Ai]. 


Proof (of Lemma C.l). First it is easy to check that ||p ||2 = /^i \g{f)\^df = 1. Then computing the 

~ 2 

inverse DTFT directly yields 


sM 




i\ 1 . 

5 j + VW““ 



Let £,{t) = sinc(t — |) + sinc(t -I- |). Taking the directive of ^{t), we would find on [—|, |] that ^(t) 

_ I iY. I 

achieves its minimum value of 1 at the points t = ±|. Therefore, g[n] > since | | < | for all 

n e [Aij. □ 
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C.l Upper bound 

From equation (23), we know that 


\(0 

= mm max 
Si yesam),yJ-Si 


U\y{f)\"df 


Therefore, in order to bound the eigenvalues from above, it suffices to pick an appropriate Z-dimensional 
subspace Si C f' 2 (Z) and then find a uniform upper bound for the quantity above for all time-limited 
sequences y € S')[A'']) orthogonal to Si. 

Consider the bandlimited sequence g € defined in (24). Let £fg : £ 2 )^) —>■ ^ 2 )^) denote a 

modulating operator with £fo{y)\n] := e^'^'^^°'^y[n] for all n £ Z and /o £ |]- Set 

1 ^ 0 } 


Lj- = |n^€Z: — — < n < - , (-,- 1 -) fl ^ 

+ 1 L 2 j - - L 2 2N'N 2N’ 


and hence t+ = #L+- Let Si,^ be the t+-dimensional subspace of f' 2 (Z) spanned by the functions 
£^g,n' £ L+, that is, 

TV 

Si^ := span • 

If the time-limited sequence y £ S([Af]) is orthogonal to Si_,_, then 

0 = {y,£^g) = {y,g{- - ^)) = =■ gy[n], n £ L+, 

where g ■— g* is the complex-conjugate of the sequence g and g is the DTFT of g. 

Now it follows that 

I N-l I 

|2 


L^J 

\9vW]? = ^ \9y{n]\^ 

'i'=-Lf J 


n'eL^ 


E i/J 


' + 1/2 



- 1/2 y(f')9i'^ - f)df\^ 


m)fdf 


(25) 


= \\y\\l-f mf)\ 

J few 


■df, 


where is defined as := {n' £ Z : — ["yj < n' < n' ^ the second line holds 

because g is bandlimited to the third line follows from the Cauchy-Schwarz inequality, and 

the fourth line holds because ||p ||2 = 1 and by construction, the set ^ covers 

the intervals W completely. On the other hand, let y Qg denote the pointwise product between y and 
g, that is {y © g)\n] = y\n\g[n]. Note that y Qg has the same support in time as y, namely [N], and 
[yj < n' < LJ} forms an orthobasis (normalized DFT basis) for C^. We can rewrite 

QyW] = {y Qg) I which can be viewed as the DFT oi y Qg. Therefore, using Parseval’s theorem, we 

XT 

acquire 

1^1 

111 ii2 


> 


L^J 

l5!/Kll^ = ^lly©; 

since by hypothesis, g[n] > for all n £ [N]. Now, combining the above lower bound on the energy 
of the sequence gy and the upper bound in (25), we observe that 

1 , 


< 


/ I 

J fGW 


\yif)\ df, 


and therefore. 


^ iv,w — ||-,||2 — 9 * 
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C.2 Lower bound 

In the other direction, consider the minimax representation 


x(0 

^iV,W 


max 
Si + i 


/wly(/)P4f 


In order to find a lower bound for the eigenvalues, it suffices to pick an appropriate {I + l)-diniensional 
subspace Si+i C £ 2 (Z) and then find a uniform lower bound for the quantity above for all time-limited 
sequences y € S'([A^]) inside Si+i. With g as defined in (24), let the time-limited sequence h € i 2 {[N]) 
be such that h[n] = l/g[n] for all n £ [A^]. We set 


r r I n, \ ^ \ ^ ^ \ N ~ ^ \ /"n-' 1 

- {n £ : — \ _n _ L 3 -I’ ^N~ 


C 


and hence i- = Let Si,_ be the (.--dimensional subspace of i 2 {Z) spanned by the functions 

£^h,n' £ L-, that is, 

N 

S,_ := span ^ . 

Suppose y £ (and hence y £ £ 2 ([A''])). Then we may write 


y = b„>£^h 

^^ N 

n' 


for some coefficients &„/. Moreover, 

y®9= 

^^ N 

n' 

Noting that , — [yj < n' < forms an orthobasis for C^, we obtain 

N-l N-l 

\bn'\^ = N\\yQg\\l = AT ^ l?/M ©flMI^ - 2 ^ ^ 

n'^L_ n=0 n=0 

since by definition, g[n] > -y= for all n G [N"]. On the other hand, 

N-l , 

bn' = Y = {y,£^g). 

^^ N 

n=0 

Now using the same procedure as in (25), one has 

Y = Y Ky>^id5)l^ 

^^^ N 

n'GL_ n'^L_ 



where the last line holds since by construction, the set U„/gi. ^ + ^1 ^ subset of the intervals 

W. Altogether, we then conclude that for any y £ Si,_ (and hence y £ 5'([A'])), 

gWvWl < [ \y{f)fdf. 

^ J few 

And hence 
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D Proof of Theorem 3.4 


D.l Proof of eigenvalues that cluster near zero 

Proof. Since Bjv.w = Ef. , according to [24] (see pp. 181), the following holds 


x (0 

^iV,W 


J-l 




(h) 

N,Wi 




for all li e [A^], i e [J] and I — J2i=o ^ [-^l- 

Fix e e (0, — 1). For each i G [J], let Ni{Wi, e), C 3 {Wi,e) and C 4 ,{Wi, e) be the constants specihed 

in Lemma 2.3 with respect to Wi and e. If we let A'^i(W, e) = max{A'^i(Wi, e), V i € [^]}, then we have 

, Vh> [2Wi(l + e)l,ie [J] 


for all N > Ni{W, e). Hence, by choosing k > \2NWi{l + e)], V i £ [J], we have 


x (0 

''n,v/ 


i=0 




for all N > Ni(W,e) and I > Y:i\‘2NW^{l + e)l,where C' 3 (W,e) = J max {C 3 (W*, e), V i £ [J]} and 
C 4 (W,e) = min{C' 4 (IFi,£), Vi£ [J]}. □ 


D.2 e-pseudo eigenvalue and eigenvectors 

Definition D.l. (e-pseudo eigenvalue and eigenvector [34]) Let X £ be any matrix and u £ 

he any vector with unit h-norm. Given e > 0, the number A £ C and vector u £ are an e-pseudo 
eigenpair of X if the following condition is satisfied: 

\\{X - XI)u\\l < e. 

Lemma D.2. Suppose W is a fixed finite union of J pairwise disjoint intervals as defined in (5). Fix 
e £ (0,1). For each i £ [J], let No{Wi, e) be the constant specified in Lemma 2.3 with respect to Wi and e 
and let No{W, e) = max{Afo(VFi, e), V i £ [J]}. Then for all k < 2NWi{l — e),i £ [J] and N > No{W, e), 
(^n'w ’ ) e-pseudo eigenpair ofTNBw/Lff with e < 2Ci{Wi,e)e~^^^^'’^'*^, or in detail 

where of''’ = irjv(Hw\[/i-Wi,/i+Wi] and||of’)||i < 2Ci{Wi, . Here W \ 

[fi — VVi, fi + Wi] = U [fit — Wp, fi' + Wi'] means the set difference between W and ]fi — Wi, fi + Wi], 

i' 

and C'i(Hi,e) and C' 2 (Hi,e) are the constants specified in Lemma 2.3 corresponding to Wi and e for all 
iG [J]. 


Proof (of Lemma D.2). According to the definition of the operator 

E V^ J 27 r/., (m-n) Sin(27rIFi/(m — n)) j2ir/in (li) r i 


n=0 i'=0 


j2,Tfim,(li) (li) r ■ 


7 v{m — n) 

N-l J-l 

+ E E ®' 

11=0 


j 27 r /^/ (m — n) Sin(27rl4^j/(m ^)) j2'nj\n (li) 


7r(m — n) 

^'iv.Wi^^v.Wi L- 21iv(Hw\[/i-Wi,/i+Wi] jN{Bfis(f)^,)))[m]. 


’N,Wi 


[n] 


_gj2ir/im ,^(4) ^(li) 
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In what follows, we will bound the energy of = irjv(Z?w\[/i-Wi,/i+iVi]))) as 



1 ITjv (Hw\[/i 

-Wi,fi + Wi] 

(IfflEf.s 

(h) 

N,W 

.)))lli 


< 


,fi + Wi]{DN 


.))l 

2 

2 


< 


-Wi,fi+Wi 

{TN{EfF 



= 

Ii4:kii2- 


fi + Wi]{'^N 

(Ej 

gih) 

i°N,W 

))ll2 

< 

Ii4:l.ii2- 

l|2:iv(%.- 

-WiJi + Wi 


{Ef,s 

v:lv.)))ii 

< 


^ < 1 - (1 

-Ci(IUi, 

e)e 

-C2(W, 

,e)iV^2 


for all li < y2NWi{l — e)J, i G [J] and N > No{W, e). Here the second inequality in the sixth line follows 
simply from Lemma 2.3 since No(W,e) > No{Wi,e). □ 

Using this result, we now show the first « A^|W| eigenvalues of XnBw^n s^re close to 1. 

D.3 Proof of eigenvalues that cluster near one 

The main idea is to guarantee that the sum of the hrst « N |W| eigenvalues is sufficiently close A'^IW]. 
Then we conclude that the first « A^|W| eigenvalues cluster near one by applying the fact that the 
eigenvalues are upper bounded by 1. First we state the following useful results. 

Lemma D.3. ([13] Lemma 5.1) Fix e € (0,1). Let ki = \2NWi{\ — e)J, V i G [J], and let be the 

dietionary as defined in (12). Then for any pair of distinct columns tfi and ip 2 in we have 

nz Cocw.s) „ 

KV’i,V’ 2 )| <3\/Ci(W,e)e-(26) 

and 

II TJ II “ C2(W,e)]V 

< 1+ 3AiJc'i(W,£)e-5- 

il II 2 ^ 

if N > iVo(W,e), where di{W,e) = maxe), V i G [J]} and 
C' 2 (W,£) = mm{C 2 {Wi,e), V i G [J]}. Here ||^^^'||2 is the spectral norm (or largest singular value) of 

Lemma D.4. ([24]) Let X G be a Hermitian matrix, and let Xo{X), Xi{X),..., Xn-i{X) be its 

eigenvalues arranged in decreasing order. Then, 

XfiX) + Xi{X) F ... + Xr-i{X) = max trace(?7"Xf7), 

UeC">^’-,UV!U = Ir 

where Ir is the r x r identity matrix with 1 < r < N. 

Based on this result, we propose the following generalized result concerning the sum of the first r 
eigenvalues. 

Lemma D.5. Let X G ([;;JVxjv ^ positive-semidefinite (PSD) matrix, and let 

Xo{X), Xi{X),..., Xn-i{X) be its eigenvalues arranged in decreasing order. Then, for any matrix 
M G 1 < r < A^, the following inequality holds 

Xo{X) + Xi{X) + ... + Xr-i{X) > trace(M"XM)/||M"M|| 2 . 

Proof (of Lemma D.5). Let ao{M ),..., ar-i{M) denote the decreasing singular values of the matrix 
M. Denote M = as the truncated SVD of M, where Sr is an r x r diagonal matrix with 

cto(AL), ... ,ar-i{M) along its diagonal. 

Now applying Lemma D.4, we obtain 

r—1 

^Ai(X) > trace(f7r"Xf7r) 

1^0 

> trace(SrI7r"XI7rSr)/(cro(M))^ 

= trace(yrSrI7r"XI7rErK")/||M"M||2 
= trace(M"XM)/||M"M|| 2 , 


20 




where the first line follows directly from Lemma D.4, the second line is obtained because XUr is 
PSD and hence its main diagonal elements are non-negative, and the third line follows because Vr is an 
orthobasis and = \\M^M\\ 2 - □ 

We are now ready to prove the main part. Fix e £ (0,1). Let ki = YlNWiil — e)J,Vf £ [J], and let 
be the dictionary as defined in (12). We have 




E 


\(0 


> trace 




^j -1 L2ivm,(i-£)J \ 


^2 = 0 ^ 1=0 

( j -1 L2ivm,(i-£)J 

E E 

2 = 0 li=0 




/.7-1 L2vm,(i-£)j \ 

a E E (41,-lloEll.) /||*«*| 


^ 2 = 0 li=0 

Y^J-1 sr^l2NWi(l-€)] 

Z^ 2=0 2 ^L =0 




C2 (W,e)iV 


l-b3iVJC'i(W, e)e—^ 


> 


(eE' - CiiW, e)e-S2{w..)v _ Ci{W 

1 -b 3Aiy^Ci(W,£)e- ''"*"’'^" 

C2(W,€) 

J + J2, L2A^V^".(l - e)J - SNCsjW, e)e - 


Co (W,e)N 

l + 3A^<E5(W,e)e—- 


C 2 (W,e) 


C2(W,e)Ar 


Co (W, e)iV 

1 + 3 iVC' 5 (W,£)e—Es- 


l- 3 iVC' 5 (W,£)e-5 

C2(W,5)N 


E’2{y 

J + J2,[2NW,{1 - e)\ -3NC5iW,e)e -J 

1 - 3NC5{W,e)e 

„ CoCW.e) / C 2 (W,e) „ 

J + J2i[‘2NW^{l - e)\ - QN^C5iW,e)e --b ( 3 ArC' 5 (W, e)e- 

1 - (3NC5{W,e)e-^^^^^ 


> J-b^L2iVWi(l -e)J -6N'^C5{W,e)e~^^^^ 


for all N > max{iVo(W,e),Ai'(W,e)}, where N'{W,e) = max{(^j(|^)^, log( 3 C' 5 (W, e))} is the 

C2(W,e)Ar , ^ 

constant such that 3NC5(W,e)e 2 <1 for all N > N {W,e). Here the first line follows directly 

from Lemma D.5, the second line follows because trace ,w^) = trace (j^Lo Bn ,w^i^ and 

Bn,w is equivalent to TnBwI-n, the third line follows from Lemma D.2, the fourth line follows from the 
Cauchy-Schwarz inequality which indicates that ^ Il 2 ||ofII 2 = llof’^lh, 

the fifth line follows from Lemmas 2.3, D.2 and D.3, the seventh line follows by setting C' 5 (W, e) = 

maxICi (W, e), \JC\ (W, e)}, the ninth line follows because J -b Ei Y2NW (1 — e)J < N, and the last line 

C2(W,c)iV 

follows because by assumption 3NC5{W,e)e 2 < 1 . 


C2(VI,e)N 


®This can be verified as 37VC5(W,e)e- 2 - = 3C’5(W, e)e“"^— 2 - 1 ^) < 3C5(W,e)e“ 

^ log(3C5(W, e))}. Here the first inequality follows because ^ 


-N 


C 2 W,e) 


max{(^^ (w,e) 

( _4_n2 

'■ C2(W,e) > 


< 1 for all N 
^ for all N 
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By noting that 0 < < A^^^^ < 1 from Lemma 3.1, we acquire 


x(0 _ 

^Af,W ^ 


> 


./-1+Ei l^NWiil- 

e)J 

E 

x(i') 

2'JV,W 

l'=0 


'j-l+Ei l2NWi{l- 

e)1 

E 

x(i') 

2'JV,W 

l'=0 


- QN^CsiW, e)e 

C2(W,«) 

2 


./-l + EiL2JVWi(l-e)J 

E 


>(i') 


- [j-l + Y,l‘2NWi{l-e)\ 


for alH < J — 1 + X]i \^NWi{\ — e)J, where the second line follows by setting A^ L yf Z to 1. Fix W and 

_ C2(Vf,e)N 

t. It is always possible to find a constant N such that SNCslW, e)e ^ <1 for all N > N . Now, 

and ]Vo(W,e) = max{iVo(W, e), AT'}. 


for convenience, we set C'i(W, e) = 6 C' 5 (W, e), C 2 ( 
This completes the proof of Theorem 3.4. □ 


^ ^ C2(W,6) 


E Proof of Theorem 3.6 

Proof. First denote the eigen-decomposition of Bjv.w as 

Bn,w = Lfjv.wAjv.wLfjv.W) 

where An,w is an N x N diagonal matrix whose diagonal elements are the eigenvalues 
wi w> • • • > Un,w is a square {N x N) matrix defined by 

TT r (0) (1) (JV-l)i 

■— ['*^iv,w '*^]v,w • • • '*^iv,w 1 - 

Also let a = be the coefficients of ip represented by Un,vi- 

Fix e € (0,min{l, -j^ — 1}). Suppose %p is a column of for some particular i £ [J]. Now from 
Lemma D.2, we have 

Bjv.wV’ = 

for some k < L 2 AiVFi(l — e)J. 

Plugging the eigen-decomposition of the matrix Un,w into the above equation, we require 

AiVjWU' — ’ 

where The elementary form of the above equation is 

A^w«H = A^’^.afm] + of'^[m] 

for all m £ [A]. 

Now we have 

iV-l iV-l 

m—Ei r2A'’VPi(l + e)] m=Y^^\2NWi{l + e)'\ “^iV,W 


(27) 


for all N > max{Ao(W, e), Ai(W, e)}, where the second line follows by bounding the A^’^. term using 1 — 
C'i(IFi, (which is not less than 1 — Ci(W, e)e“^ 2 {w,e)]v^ from Lemma 2.3 and bounding the 


< 


— 1 '“'f l-i) [ 1 


< 


< 


(^1 - C'i(W,£)e-C2(w,e)iv _ C'3(W,e)e-c'4(w,6)iv^ 

__ 

(^1 - Ci(W,£)e-C2(w,.)iv _^3('^^^)g-C4(w,.)iv^ 

('l - Ci(W,£)e-^2(w,.)Jv _^3('^^g)g-C4(w,.)iv\ 
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terms using Theorem 3.4, and the fourth line follows because ||of‘^|p < 2C'i(VTi,e)e < 

The following general result will help in extending (27) to an angle between the subspaces. 


Lemma E.l. Let Su o-nd Sv be the subspaces spanned by the columns of the matrices U £ and 

V £ respectively. Here r < q < N. Suppose each column of V is normalized so that ||v !||2 = 1 

and is close to Su such that for some 5i, ||n! — i-’c/'i’iHi < <5i for all I £ [r]. Furthermore, suppose the 
columns of V are approximately orthogonal to each other such that for some 82 , |(Dfc,Di)| < 82 for all 
k ^ 1. Then we have 


COs(©Su‘Sv ) — 


1 — 5i — (^2 + v'^) 

1 + IV 82 


Proof (of Lemma E.l). Any n £ Sv can be written as a linear combination of u; intheformn = 'Y^^aiVi. 
We first bound the I 2 norm of v by 


r—1 

hWl = W^aiViWl 

Z-0 

r—1 




+ E E {aiVi,akVk} 

1=0 1=0 k=0,kj^l 

r—1 r —1 r—1 

<Ei“'i^+E E i“'ii«fci^2 

1=0 1=0 k= 0 ,k^l 

r—1 r —1 r—1 , ,0 , ,0 

sEw’+E E t'”' fc 


1=0 
/ r—1 


1=0 k=0,k^l 

= (^E (1 + (^ - 1 )^ 2 ) < (1 + ^^ 2 ), 

where the third line follows from the hypothesis that \{vk, i>!)| < <52 for all fc yf L Similarly, 


\Puv\\l = II (aiVi) \\l 

1=0 

r —1 r —1 r —1 

= E + E E {aiPuvi,akPuVk} 

1=0 1=0 k=0,k^l 

r —1 r—1 r —1 

= ^|a,niPr/^,||i + E E {aivi, ak {vk - {vk - PuVk))) 

1=0 1=0 k=0,k^l 

r —1 r —1 r—1 


>^|a;r(l-5i)-^ ^ \a,\\ak \(82 + V^) 

'=0 1=0 k=0.k^l 

/r—1 \ /r—1 

X; l«il" (l - - (r - 1) (^2 + > X] \ai\^ (l - 5i - AT (^2 + ^)) 


where the fourth line follows because {vi,Vk — PuVk) < ||t’!|| 2 ||r’fe — PuVk \\2 < and |(i;fc,i;i)| < 82 
for all fc yf L 

Therefore, for any non-zero vector v £ S\r we have 


WPuvWi . 
Ii«ll? - 


1 — (5i — Ai ((52 + \/(^) 

1 -|- NS 2 


□ 


Finally, (15) follows from Lemma E.l by replacing U with $ and V with and assigning (5i with 
the upper bound in (27) and 82 with the upper bound in (26). □ 
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F Proof of Theorem 3.7 


Proof. For each i G [J], define = [Ef^SN,Wi\J^N,Wi]ki for some given ki £ {1, 2,..., A^}. We 
construct the scaled multiband modulated DPSS matrix ^ by^ 


^ := [^0 (28) 

The main idea is to bound using j|^ . In order to use this argument, we first 

give out some useful results. 

Lemma F.l. Suppose is the matrix defined in (28) with some given ki £ {1, 2,..., Ai}, Vi £ [J]. Then 




2<1- 


Proof (of Lemma F.l) Let y ^ C^. Then 

J-l ki-l 

\2 


1=0 li=0 

J-lki-1 - - 

= E E V ) {Ei. sj 4 - , y) 

i=0 li=0 
J-l ki-l 

- c(h) x(h) (Jk) 

~ / . / . y ^Si^N,Wi^N,Wi\^N,Wi) ^fiV 
i=0 li=0 
J-l N-1 

i=0 li=0 

J-l J-l 

= Y,V^ Ej,Ir,{BwJlN{Elv))) = Y.‘^lN{BwjrN{Elv))),Elv) 


2 = 0 
J-l 


i=0 

J-l 


= Y^{Bwjr^{Elv))XN{Elv)) = Y.{Bwjr^{Elv)),Bwjr^{Ef^v))) = E \\BwJTl,{Elv))\\l 


i=0 

•r-1 „f-+W: 
i=o 


^1/2 

\v{f)?df = ^ ^[fi-Wi,fi+Wi) if )) \yif)\"df, 


where the fourth line follows because = \\\/E"y \\2 > 0, 

the fifth line follows because 5 DEo ~ 2iiv(I?Wi(Ijv(®))), and we use y{f) = 

y[n]e~^^^^" as the DTFT of Xfqijj) in the last three equations. 

Noting that J2iZo ^lfi-Wi,Wi+fi)if) < 1 for all / G |] since we assume there is no overlap 
between each interval [J — Wi,Wi + fi), we conclude 

Il^%ll 2 < r" \y{f)fdf = \\y\\l 
J-1/2 


and 


1^112 < 1. 


□ 


Lemma F.2. For any ki £ {1, 2,..., N}, i G [J], let and be the matrices defined in (12) and (28) 
respectively. Then for any y £ , 


\\P^y \\2 > ^“y\\ 2 . 


(29) 


^Hogan and Lakey [23] considered the scaled and shifted Prolate Spheroidal Wave Factions (PSWF’s) and provided conditions 
on a shift parameter such that the scaled and shifted PSWF’s form a frame or a Riesz basis for the Paley-Wiener space. 
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Proof (of Lemma F.2) Let be a reduced SVD of 4', where both L% and V% are 

orthonormal matrices of the proper dimension, and is a diagonal matrix whose diagonal elements 
are the non-zero singular values of We have 

\\^^" v \\2 = \\ U ^% U ^ v \\2 
< \\u^y\\2 
= \\u^u^v\\2 
= \\P^V\\2 

where the second lines follows because Ill'll 2 < 1 and hence the diagonal elements S-gr are bounded above 
by 1, and the fourth line follows because each column in is in also and hence \\P.,v\\ 2 ^\\Pu^y\\ 2 . 
□ 


Now we turn to prove Theorem 3.7. By (29), we observe that 


\P<ffU 


> 11^ ^ u 


H (1) 

n I '■ > 


N,V 


,7-1 fci-1 

x(h) fjh) \HjpH (i) II 

i=0 !i=0 

J-1 JV-1 

-MR P =(h) x(li) (Ah) \H „H (I) II 


,(0 


i=0 li=ki 
j_l N -1 


^ 11-^iV,W'^jV,wl 12 -EEii 

i=0 li = ki 

j-l N-1 

>4!w-E E ° 


G Proof of Corollary 3.8 

Proof. It follows from Theorem 3.7 that 

J_1 iv-l 

> - E E ^ 




(h) 

N,W 2 


i=0 L=ki 


i=0 l^=ki 

_ J-l N-\ _ 


7—0 

> 1 - Ci(w, - iVC3(W, 

for all N > max{A^o(W, e), Ni(W, e)}, where the second line follows by bounding the term using 

Theorem 3.4 and by bounding the A^*^. terms using Lemma 2.3, and the third line follows because 
C 3 (W,e) = Jmax{C' 3 (Wi,e), V i G [J[} and C 4 (W,e) = min{C' 4 (W"i,e), V i G [J]}. 

Let K 2 {N,W,e) = + NC 3 {W,e)e-^*^'”’^^’^. Then \\u%\^ - Pxs,u^^\j\l < 

2 K,2{N,W,e) — K, 2 {N,W,e). Noting also that ’’^w.w) = 0 S'!! ^ ^ I, (16) follows directly from 

Lemma E.l. □ 


H DTFT of DPSS vectors 

The results presented in this appendix are useful in Appendix I, where we analyze the performance of 
the DPSS vectors for representing sampled pure tones inside the band of interest. Let ^Nwif) denote 
the DTFT of the sequence 7iv(s^'^vy), i.e., Figure 1 shows J) 
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/ 


I 1) M 

Figure 1: Illustration of\s\/^{f)\ , or the energy in {e/} captured by each DPSS vector. The horizontal axis stands 
for the digital frequency f, which ranges over [—|, |], while the vertical axis stands for the index I £ [A^]. The l-th 

I , , |2 

horizontal line shows lOlogjQ Sjv iv(/) • Here N — 1024 and W = 


for all I £ [A'"] with N — 1024 and IF = We observe that the first « 2NW DPSS vectors have 
their spectrum mostly concentrated in [—IF, IF], only a small fraction of DPSS vectors whose indices 
are near 2NW have a relatively flat spectrum over [—|, |], and the remaining DPSS vectors have their 
spectrum mostly concentrated outside of the band [—IF, IF]. This phenomenon is captured formally in 
the asymptotic expressions for and from [37]. 

Lemma H.l. ([37]) Fix IF £ (0, |) and e £ (0,1). Let a — 1 — j4 = 1 — cos 27 rlF. 

1. For fixed I, as N ^ oo, we have 


and 


Here 


1-A' 


(0 


F(: 


2\/2a 


^N.wif) 


C 3 / 4 (/), W <\f \ < arccos(yl — N ^^^)/27r, 
csfsif), arccos(A - N~^^'^)/2n < \ f\ < 1/2. 


= (Z!)-l/2^1/42(l«+15)/8^(2i+3)/S^(2i + l)/4^2 _ Q)-(*+F2)/2g-iJV^ 

C3 = 7vH^2-^/^a-H^[2 - = 0 (nH^)cs, 

2y^ , 


7 = log(l + 



N 


\JA — cos (27rf)'j 


cos (f arcsin(6>(/)) + |(Z + |) arcsin ((/(/)) + (Z - A^)f + 
((A - cos (27r/))(l - cos {2'Kf)))H^ 
ait\ _ Q + 2cos(27r/) _ (2 - 3 q) - (2 + a) cos (27r/) 

^\ J ) r \ _ ’ /O f \\ ’ 

2 —a (2 — Q;)(l — cos (iTr/j) 

where Jo is the Bessel function of the first kind. 

2. As N ^ oo and with I = [2A''1F(1 — e')J for any e £ (0, e], we have 


1-A 


(0 

N.W 


27rI/2 ^do 
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and 


Here 


~{Z) 

5 


N,W 


if) 


d4g5{f), W < I/I < arccos(A — A'" ^)f2'K, 
degelf), arccos(A - iV“^)/27r < |/| < 1/2. 


d4 = (L2)-'/"^(1 - 2l2)-l/4g-CL4/4g-iVL3/2^1/2^ 


55(/) = Jo I nJ (cos(2^/) - 


I .r /■B-cos(27rt),_ /"!/" 

goif) = R{f) cosIttN I J - - TT^^dt + — ' 


dt 


f y A — cos(27rl) 2 Jf ^/(-B “ cos(27rl)) {A — cos(27ri)) 

B(/) = |(B-cos(27r/))(^-cos(27r/))|“^/'‘, ^ mod (t^“^^ 0 i’ 


+ 6 » 


'= mod-L 5 - Lq.I'k 

4 2 4 ° 


ii = 


B(e) = 


r p(c)rfc, B2 = r Qiodi, B3 = Piodi, u = g(c)dc, is = r u 

JB JB Ja Ja J-1 


C-B 


(e-^)(l-e) 


1/2 


, Q(o = l ( c - B )( e - A )( i -^' 


2M-1/2 


where B is determined so that A)(f ~ mod {y,2Tv) returns the remainder 


after division of y by 2 ti. 


I Proof of Theorem 3.9 

Noting that Sn,w forms an orthobasis for , the main idea is to show that the DPSS vectors 

^ have their spectrum most concentrated outside of the band 

[-W,W]. 

Since the sequence is exactly bandlimited to the frequency range |/| < W, we know that its 

DTFT s^N,wif ) •= 5I]^-tx> vanishes for all IF < |/| < By noting that the first « 2NW 

DPSS’s are also approximately time-limited to the index range n = 0,1,..., — 1, we may expect that 

'^N,wif) •= is also approximately 0 for all IP < |/| < | and I < 2NW (1 — e). This 

illustrates informally why the DTFT of the first « 2NW DPSS vectors is concentrated inside the band 
[—IP, IP]. By employing the antisymmetric property [37] which states that |s^^i 4 ^(/)| = “/)l i 

we then have that the DPSS vectors • • ■ ’ ®'''® almost orthogonal to 

any sinusoid with frequency inside the band [—IP, TP]. 

Recall that is the DTFT of the sequence TNis%]w), i.e., s^^\^[n]e~P^-^". 

We have 

for all I € [N"]. As we have observed in Figure 1, the spectrum of the first « 2JVW DPSS vectors is 
approximately concentrated on the frequency interval [—TP, TP]. This behavior is captured formally in 
the following results. 

Corollary I.l. Let A = cos27rTP. For fixed TP £ (0, |) and e £ (0,min(2P — 1,1)), there exists a 
constant C6(TP, e) (which may depend on TP and e) such that 

l4!w(/)l < C6(TP,e)iV3/"e-^^^^, TP < |/| < 1/2 

for all N > A^o(TP,e) and I < 2NW{l — e). Here C 2 {W,e) and No{N,e) are constants specified in Lemma 
2.3. 


L2, 
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Proof (of Corollary I.l). 

The main approach is to bound W < \f\ < 1/2 with the expressions presented in 

Lemma H.l. Suppose e £ (0,1) is fixed. 

1. For fixed I and large N: 

In order to quantify the decay rate of we exploit some results concerning of / 4 (/) from [32] 

and / 5 (/) as follows; 

|Jo(a:)| < 1, V a; > 0, (30) 

and for any < j/j < 1/2, one may verify that 


If ( fM _ - _ 

((A - cos (27r/))(l - cos (27r/)))^/'‘ 

“ ((A -{A- Ar-3/2)) (1 - (^ - Ar-3/2)))i/4 

< _ ^ 

((]V-3/2))(JV-3/2)))1/4 


where the last line follows because 1 — ^4 > 0. 

Recall that cs = (2 — and cs ~ ^ 2-\f2a ^1 — ■ Plugging these 

into Lemma H.l and utilizing Lemma 2.3, we get the exponential decay of ^r{f)\, \f \ > IF as 


C'/(IF,e)lV^''2e-^^, IF < j/j < arccos ("h - /2n, 

C'^(IF,e)Ar3/\-^^, /27r < j/j < 1/2, 


for fixed Z and A^ > No{W,e)- Here C'/(IF,e) = ^ 1 / 221/4 _ q,)-i/4 ^Ci{W, e), 
Cs(W,e) = (2\/^C'i(IF, and A^o(IF, e), C\{W,e) and C' 2 (IF, e) are constants as specified in 

Lemma 2.3. 

2. For large Ai and ? = L2AriF(l - e')J, V e' £ (0, ej: 

Note that is a decreasing function of B and /j ^ 2W tv > A tv. 

Hence 1 > H > H. Now we have 


|56(/)| < |R(/)| < 


< 


{A-cos{2Tvf)Y/^ ~ {A-{A- ~ 


< AiH2 


for all arccos( 2 l — N ^)/2tv < j/j < 1/2. 


x(<) 

N,W 
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Recall that |p 5 (/)| <= 1 from (30), d 4 = 7r^/^(l — A^) ^1^2 and 

Plugging these into Lemma H.l and utilizing the bound on in Lemma 2.3, we get the 

exponential decay of |^^vv(/)l’ I/I > IF as 


^N]wif)\ < 


£2 ; 


C^'{W,e)N^/^e-A 


£2 ; 


C^{W,e)N^^^e-A 


IF < I/I < arccos[H — N ^]/27r, 
arccos[ 2 l — N'“^]/27r < j/j < 1/2, 


for all I = [2AriF(l - e')\, V e' £ (0, e] and N > No{W,t). Here C'^'(IF,e) = ^Ci(IF, e)/27r, 
C'/'(IF,e) = 2-1(1 - H2)-i/4yCA(H^, and No{W,t), Ci(IF,e) and C' 2 (IF,e) are constants as 
specified in Lemma 2.3. 


Set 

^(IF, e) = max {^/(IF, e), ^^(IF, e), ^/(IF, e), ^^(IF, e)} = max IttI/" 
This completes the proof of Corollary 1.1. □ 


2-a 


1/4 


,2-1(1-H''; 



\/C7i(1F,£). 
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Lemma 1.2. ([37]) For fixed W G (0, |) and e G (0, - 1), s%\^{f ) and (/) satisfy 


?{0 


■(/)l 


^ l~{iv-i-o.l _ 


for alll> 2NW{l + t). 

Now we can conclude that (e/,s®;^) decays exponentially in for all / > 2NW{l + e) and |/| < W 
by combining the above results. 

Corollary 1.3. Fix W G (0, i) and e G (0, ^ - 1). Let W = ^ - W and e' = Then 

Ke/,4y)| = |4V(/)| < V|/| < W 

for all N > No{W',e') and all I > 2NW{1 + e). Flere, C 2 {W',e') and No{W',e') are constants specified 
in Lemma 2.3 with respect to W' and e , and C6(M^^e^) is the constant specified in Corollary 1.1 with 
respect to W' and t . 


Proof of Corollary 1.3. Let I' = N — 1 — 1. For all I > 2NW{1 + e), we have 

, 1 W 

l' = N-l-l<N- 2NW{1 + e) = 2N{-- W){1 - ^ —e). 

2 2 “ ^ 

Let W' = I — VF and e = £ (0,1). It follows from from Corollary I.l and Lemma 1.2 that 


Ke/, 


4!w)l = Kei p4'U)l < Ce{W)e’)N^'\- 


C^CW'.e') 


V I/I < IF 


for all Ai > Aio(IF',e')- □ 


Recall that C6(IF',e') = max|xi /2 2 -i(i _ 42 ^-i/ 4 | .yc'i(W',e') with A = cos(2xIF) and 

a = 1 — A. As IF gets closer to 0 or the variable (1 — becomes larger, and we have 

(1 — —>■ l/-\/2xIF as IF —>■ 0. Also we have (1-)^^^ —>■ Ij-s/'xW as IF —>■ 0. Therefore, for any 

non-negligible bandwidth which is the main assumption in this paper, the variable 
max 5 2“^(1 — Ci(IF', e') would not be too large. 

Now, for hxed IF G (0, |) and e G (0, ^ — 1), we have 


|e/- P[s„,„,];,e/||i = ^ 

1=2NW(1+€) 

N-1 

1=2NW(1+€) 

< CsiW' 


for all I/I < IF and AT > Aro(IF',£'), where C9(IF',e') = C|(IF',e')- □ 


J Proof of Corollary 3.10 

Proof. Suppose / G [fi — Wi, fi + Wi] for some particular i G [J]. LetCio(W,£) = max{C 9 (IF/, e'), Vi G 
[J]} and Cii(W, e) = min{C 2 (IF/, e'), Vi G [J]}. It follows from Theorem 3.9 that 

II®/ “ P'I'®/I|2 ^ ||g/ — P[E/^ Sjv,lxj 2 JVWj(l + e) ®/I I 2 

^ ll®/-/i “ ■^[SjV,Wil2iVWi(l + e) ®/“A II 2 

for all N > No{W[, e). We complete the proof by setting A^ 2 (W, e) = max{A^o(IF', e),'ii G [J]}. □ 
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K Proof of Theorem 3.11 


Proof. Since ajo, , a3j_i are independent and zero-mean, we have 

N-l N-1 _ _ N-lJ-1 


E[||a:||^| = 


] = ® [I®MH = ^\xi\n]x'^\n] = ^ [|a;i[n]|2] = t = AT. 


n = 0 n=0 0<r,i'<J —1 

Applying Theorem 2.4, we acquire 


n=0 i=0 


E 


- Pi 


[E/. SN.wJfc 


- — V A*') 

- IWI 


Note that the power spectrum Pxi{F) assumed in (18) results in the constant instead of 


Now, we have 


E [lla: — P’^ccILl = E 


= E 


Y^xi- Xi] 


i=0 

J-1 


= E 


{xi—p^xi 


i=0 

J-1 


{xi - p^x^)^ Y (Xi — P'^Xi 


i=0 
J-1 J-1 


= E I Y, ~ P^XiWl + “ P^Xi)^ {Xi> — P^Xi 

i=0 i=0 i' =0J' 

J-1 J-1 

Y. ^ [11®^ “ P^XiWl] + y]] y]] ^ “ P^x^)^ {xi^ - P^a;i/)j 


i=0 

J-1 


i=0 

J-1 J 


= ^ E [llaji - Ppaiillj] + ^ Y, E j^ccf a:;/- a;f Ppaji/j 

i=0 i=0 

J-1 J-1 

= ;^E[||a;i-P*a:,||^] <;^E 

i=0 i=0 

,7-1 N-l 

^ IWI ^ 


\xi - P\ 


\^fi ^N,Wi]k^ *■ 


i = 0 l = ki 


where the equality in the sixth line follows because E [a;^a;;] = (E \xii])^ (E [a:;]) = 0 and E [a:^ Pij,xi\ = 
(E [xii])^ (E [P,i,a;i]) = 0 for all i',i £ 7 ^ i, and the inequality in the sixth line follows because the 

column space of [P/. Sjv.WiJfe; inside the column space of for all i £ [J]. □ 

L Proof of Corollary 3.12 

Proof. It is useful to express the sampled bandpass signal x as 

x = [ x{f)efdf, (31) 

aw 

where we recall that x{f) denotes the DTFT of x[n], which is the infinite-length sequence that one 
obtains by uniformly sampling x{t) with sampling rate Ts- 
Now it follows from (31) that 


lla: - P*a:|L = 


< 


< 


[ x{f)efdf- [ x{f)P-i,efdf 

Jw Jw 

[ *(/)(e/ - P<i>ef)df 
Jw 2 

f mf)fdf-f \\ef - P-^efWUf 

Jw Jw 

f mf)fdf ■ C7io(W, 

aw 


:)]V 


30 



where the third line follows from the Cauchy-Schwarz inequality and the last line follows from (17) and 
the fact that |je/ - P^-e/llid/ < |W| sup^-g^ ||e/ - P-^e/||i < sup^-g^ ||e/ - P»e/||i. □ 
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